1 


Chapter 1 


Random Processes 


1.1 Introduction 

The probability theory and stochastic processes become a 
basic tool designing of digital communication systems including: 

• Modeling the sources that generate the information. 

• Digitization of the source output. 

• Characterizing of various communication channels. 

• Designing of the receiver structure. 

• Evaluation of the performance of the whole system. 




2 

1.2 Concepts of Probability 

The theory of probability deals with averages of mass 
phenomena occurring sequentially or simultaneously electron 
emission, telephone calls, radar detection, quality control, system 
failure, noise, birth and death rates, and queuing theory, among 
many others. The purpose of the probability theory is to describe 
and predict such averages in terms of probabilities of events. 

If an experiment is performed n times and the event A 
occurs N a times, then with a high degree of certainty, the relative 
frequency NJN of the occurrence of A is close to the probability of 
event A. So the probability of an event A is determined by: 

P(A) = ( 1 . 1 ) 

N 

Where N is the number of all possible outcomes and N« is 
the number of outcomes that are favorable to the event A. 

The union A+B of two events A and B is the event that 
occurs when A or B or both occur. The intersection, or 
equivalently the joint, A.B of the events A and B is the event that 
occurs when both events A and B occur. Generally, if the events A 
and B interleaved each other, those union and joint probabilities 
are interrelated by the following expression: 

P(A + B)=P(A)+P(B)-P(AB) ( 1 . 2 ) 

The events A and B are mutually exclusive if the 
occurrence of one of them precludes the occurrence of the other. 
In such case they are called independent so that the union can be 
given simply by: 


P(A + B) = P(A) + P(B) 


( 1 . 3 ) 




3 


Now the conditional probability can be defined as 
follows. If the outcomes common to the two events A and B form 
their joint A.B, then the conditional probability of event A, given 
event B, is defined as: 


P(A/B) = 


P( A.B ) 
P(B) 


( 1 . 4 ) 


One can think of the conditional probability, as 
representing the likelihood of event A ’s having occurred when it is 
known that event B has occurred. However if the events A and B 
are independent, then there is no joint occurrence, so that: 

P(A/B) = P( A) ( 1 . 5 ) 

Hence, the joint probability of two independent events 
equals to the product of their individual probabilities as indicated 
in the following: 

P(A.B) = P(A).P(B) ( 1 . 6 ) 

Finally, we consider a very practical problem. It involves 
any experiment for which there are only two possible outcomes on 
any trial. Hitting or missing the target in artillery, passing or failing 
an exam, receiving a 'O' or a ' 1' in a digital bit stream transmission, 
and occurrence or non-occurrence of any event are just a few 
examples. 

Assume p is the probability of occurrence of an event A. 
So, the probability of non-occurrence of such event will be termed 
as q whereas q=l-p. 

After repeating the experiment N times, the probability that 
A is observed exactly k times out of N trials, is given by the 
Binomial distribution. This may be represented as follows: 




4 


P( A occurs exactly k times ) = p k (l—p) N k ( 1 . 7 ) 

k!(N-k)! 

It is important to note that this Binomial distribution 
dealing with combined experiments or repeated trials of a single 
experiment. 

That is to select n from N simultaneously in 1 -trial or to 
select n in successive repeated //-trials . 

1.3 Random Variables 

A random variable is a number assigned to every outcome 
of an experiment. This number could be the voltage of a random 
source, the phase of a random signal, the power of a received 
signal, or any other numerical quantity that of interest in the 
performance of the experiment. 

The concepts concerning random variables are too long. 
Only the main topics used in this chapter are summarized to make 
it useful, readable, and simple without need to further investigation 
on various literatures. 

A random variable X is characterized by three basic 
functions that allow for ready evaluation of any probabilistic 
question about the random variable. The most fundamental 
function is the cumulative distribution function cdf (or simply 
the distribution function) defined by the following expression: 

FJx) = p(X<x) ( 1 . 8 ) 

That is the probability of the event, “ The random variable 
X takes on a value equal to or less than x”, in a trail. The 
cumulative distribution function cdf is characterized by the 
following properties: 





5 

• F x ( -oo ) = 0 

(1.9) 

• F x (co) = 1 

(UO) 

• 0<F x (x)<l 

(1.11) 

• F x ( x, )<FJx 2 ) if x l < x 2 

(U2) 

• p(x x <X<x 2 ) = F x (x 2 )-FJx x ) 

(U3) 

• F x( x+ )= F x( x ) 

(1.14) 


For example, the discrete random variable generated by 
flipping a fair coin has the cdf shown in Fig. 1.1. a. There are two 
jumps in Fix), one at x = -1 and one at x = 1. 

Similarly, the random variable generated by tossing a fair 
die has the cdf as shown in Fig.l.l.b with 6 jumps, one at each of 
the points x= 1,2, ..., 6. 

F(x) 


1 

Vi 

-1 0 1 

(a) Tossing a Coin 


Fix) 



(b) Tossing a Die 


Fig. 1.1 Examples of Discrete CDF 


6 


The second important function for the random variables 
statistics is called the probability density function pdf. The 
probability density function pdf of a random variable X is the 
derivative of the cumulative distribution function cdf defined as in 
the following: 


fx( x ) = ~r F x( x ) ( 1 - 15 ) 

ax 

Since the density function, pdf is the derivative of the 
cumulative function, cdf, also the cumulative function, cdf can be 
given in terms of the density function, pdf by integration: 


f x ( x )= lfx(y) d y 


( 1 . 16 ) 


The probability density function pdf is also characterized 
by the following important properties: 


f x (x)> 0 for all x 

CO 

\fx( x ) dx = 1 

-oo 

F x( x >= ]fx(y)dy 

-oo 

P(x 1 < X < x 2 ) = { f x ( x)dx 


( 1 . 17 ) 

( 1 . 18 ) 

( 1 . 19 ) 

( 1 . 20 ) 


Finally, the mass function is usually concerned with 
discrete random variables. 

Meanwhile, a mass function of a random variable denoted 
by p x (x) is then defined as in the following: 


Px( x) = p( X=x) 


( 1 - 21 ) 




7 


The probability density function given in equation 1.15 is 
for continuous random variables. If a random variable is discrete 
taking the values x\ with probability mass functions given by: 

P i =P(X=x i ) (1-22) 

Then, the probability density function can be evaluated in 
terms of the discrete mass functions as follows: 


f x (x) = 'LPi S(x-Xj ) 

i 


(1.23) 


It may be necessary to identify the outcomes of an 
experiment by two (or more) random variables. These random 
variables may be or may not be independent of one another. For 
two random variables X and Y, the probability that x<X <x+dx 
while at the same time y <Y < y+dy is given by: 

P( x<X <x+dx,y<Y < y + dy ) = f xy (x, y )dxdy (1-24) 

Extending this for a finite interval leads to: 

P(x l <X<x 2 ,y l <Y<y 2 )= \ )/„( x,y)dx dy (1-25) 

yi + 


The cumulative distribution function cdf of two random variables: 

F xy (x,y) = P(X <x,Y< y)= ) ) f xy ( x,y ) dx dy (1-26) 

—CO -00 

Concerning the cumulative distribution function cdf for 
some value of x quite independently of y, is given by: 

co x 

F x (x) = P(X <x,-co<y<co)= } \ f xy (x,y)dxdy 


(1.27) 




8 


Henceforth, the probability density function pdf will be 
given as follows: 

d 00 

f x < x ) = — F x< x )= U'xyd x >y)dy ( 1 . 28 ) 

ClJv — 00 

If the random variables X and Y are independent, the 
following expressions results: 

P(x<X < x+dx,y <Y < y + dy ) = [f x ( x) dx\[f y ( y ) dy\ ( 1 . 29 ) 

P(x x <X <x 2 ,y x <Y <y 2 ) = J f x ( x)dx \f y (y)dy ( 1 . 30 ) 

3 J L - y i 

fxy ( x >y) = fx( x )fy( y) ( 1 - 31 ) 

1.4 Statistical Averages of Random Variables 

If the possible numerical values of the random variable X 
are jci, X2, with probability of occurrence P(x\ ), P(xi), 

P(x 3),... etc. As the number of measurements N becomes very 
large, the outcome X=xi is expected to occur N P(x\) so that: 

x x P(x x )N + x 2 P(x 2 )N + ...= NY,x n P( x n) ( 1 - 32 ) 

The mean or average value of all these measurements is 
called the average value or the expectation of the random variable 
X, and is calculated by dividing the above sum by N: 

X = E[x ] = m = "X XjP( Xj ) ( 1 . 33 ) 

i 

The average for a continuous random variable is given as 
follows: 




9 


oo 

X = E[x] = m= \xf(x)dx (1.34) 

-00 

Furthermore, the average value of a function g(X) of the 
random variable X is given by: 

oo 

g(X ) = E[g( X )]= \g(x)f(x)dx (1.35) 

-oo 

Moreover, if the random variable X is raised to a power n, 
the average value of X n is referred to as the n— moment of the 
random variable X and will be given by: 

x n =fi[x"]= ]x n f(x)dx (1.36) 

-oo 

The variance cr of a random variable is a measure of the 
width of the probability density function. It is equivalent to the 
average value of the second moment (X-m ) 2 as follows: 


2 =2s[(X-m) 2 ] = \{x-m) 2 f(x) dx 

-00 

(1.37) 

2 = /i [( X — m ) 2 = E X 2 — 2 mx + m 1 

= E\x 2 ]-2mE[x]+m 2 
= E[x 2 ]~2m 2 +m 2 = E[x 2 )-m 2 

= £[x 2 ] if m = 0 

(1.38) 


1.5 Gaussian Probability Density 

The Gaussian probability density function is of a greatest 
importance in communications because many natural events are 
characterized by random variables with a Gaussian density such as 
the thermal noise. The Gaussian probability density function is 
defined as in the following: 




10 


f(x) = —l p ^x- m)2/ 2° 2 (139) 

V2 7KJ 2 

Where m and cr are the mean and variance of the Gaussian 
function given as follows: 

m =]-^=e-^^ 2 dx ( 1 . 40 ) 

V2 nu 1 


a 2 =\ 


( x — m ) 


-(x-m) 2 / 2(j 2 


-°° V2 


dx 


71X7 


( 1 . 41 ) 


The area under the Gaussian probability density function is unity: 


\f(x)dx = 1 ( 1 - 42 ) 


1.6 Error Function 


The cumulative distribution function cdf corresponding to 
the Gaussian probability density f(x) with zero average (m= 0) is: 

1 2 2 

F(x) = P(X <x)= j ~^=e~ x /2a 'dx ( 1 - 43 ) 

_c0 v 2 7KJ 2 


This integral is not easy and is readily available in 
mathematical tables and is termed as the error function. The error 
function of u, written as erf u is defined as: 


2 » _ 2 

<??/ m = — f e " du 

erf(0) = 0 & eif(co) = 1 


( 1 . 44 ) 


The complementary error function, written as erfc u, is defined as: 




11 


erfc u = 1 — eif u = —j= je “ du 
-J 7T u 


(1.45) 


So, the cumulative distribution in equation 1.42 may be 
expressed in terms of the error function as follows: 


OO | 2 2 00 

F(x)= J e~ x /2a 'dx-\ 


-x z /2a- 


f2. 


mi 


Vz 


dx 


nu 


The first term is equal to unity. If u = x/ v2cr then: 


F(x) = 1- j 


/42a VZ 


•Jladu 


m 


= \ — 
2 


J —j= e 11 du 

x/y[2<7 v 71 


= 1 -erf A 


f X ^ 


4ic 


(1.46) 


The erfc u is only readily available for positive u. For x <0 then: 

4*1 


F(x) = F(-\x\)= | 


1 


-.4/2 < 7 - 


Vz 


dx 


JKJ 


-|^|/a/2ct 

1 


-\x\/ J2a 


(1.47) 


f2. 


7KJ ' 


■J2 a du = —j= | e " du 

V JZ -00 


Letting g = -u yields: 


1 

2 °? „2 , 

1 

f \ 

X 

F(x)= l 

—j= { e du 

V X a / Jla 

= — erfc 
2 

y 42a) 


(1.48) 


Now, given a random variable X and its associated density 
function fx(x), what is the density function of the random variable 
Y = g(x) where g(x) is some function of XI. 




12 


To find the probability density f y (y) for a specific y, we 
solve the equation Y=g(x). If it has x n real roots and g’(x) is the 
derivative of g(x), then the density function will be given by the 
following expression: 


fx< x l) fx( X 2) fx( X n) 

+ + ...+ 

g'( x \) g'(x 2) g'( x „) 


(1.49) 


As a summary it is convenient to indicate the following 
important cases that concerns the density function of Y in terms of 
the density function of X. 


• If g(x) is a linear function of X, such as Y = a X + b, the 
derivative will be g ’(x) = a and hence: 


f y (y)= 




(1.50) 


• If g(x) is the inverse so that Y = l/X. the derivative will be g YxJ 
= -1/x 2 . Then the density function is given as in the following 
expression: 


f y (y) = \f x 
y 




(1.51) 


• If g(x) is the squarer of X (i.e., Y = aX 2 ), the derivative g ’(x) is 
given by lax, where a is greater than 0. The density of Y: 


fy( y ) = 


1 

2 ajy7 a 




,y> 0 


(1.52) 


• If g(x) is in the sinusoidal form Y= a sin(X+<f>) of n solutions 
fx(xn) with derivatives: 




13 



(1.53) 


Hence, the density function of Y is given by: 


oo 



YLf x (x n ) ,y <a 


(1.54) 


1.7 The Central Limit Theorem 

The probability density of a sum of N independent random 
variables tends to approach a Gaussian density as the number N 
increases. The mean and variance of this Gaussian density are 
respectively the sum of the means and the sum of variances of the 
N independent random variables. 

The theorem is even applied when the individual random 
variables are not Gaussian. In addition, it is applies in certain 
special cases even when the individual random variables are not 
independent. 

1.8 Random Processes 

A random process X ( A , t ) can be viewed as a function of 
two variables: an event A and time t. Fig. 1.2 illustrates a random 
process with N sample functions of time, {Xj(t)}. Each of the 
sample functions can be regarded as the output of a different noise 
generator. For a specific event Aj, we have a single time function 
X(Aj,t ) = X j(t) (i.e., a sample function). The totality of all 
sample functions is called an ensemble. 

For a specific time tfc, X(A, tk) is a random variable X{tk) 
whose value depends on the event. Finally, for a specific event, 
A — Aj and a specific time t — tk, X (A j, tk) is simply a number. 


14 

For notational convenience we shall designate the random process 
by X(t), and let the functional dependence upon A be implicit. 

1.8.1 Statistical Averages of a Random Process 

Because the value of a random process at any future time 
is unknown (since the identity of the event A is unknown), a 
random process whose distribution functions are continuous can be 
described statistically with a probability density function {pdf). In 
general, the form of the pdf of a random process will be different 
for different times. In most situations it is not practical to determine 
empirically the probability distribution of a random process. 



15 


However, a partial description consisting of the mean and 
autocorrelation function are often adequate for the needs of 
communication systems. We define the mean of the random 
process X(t ) as: 


E{X(t , t )} = C m xp Xk (x) dx = m x (t k ) (1.55) 

where X(t k ) is the random variable obtained by observing 
the random process at time tk and the pdf of X(tk), the density over 
the ensemble of events at time tk, is designated p Xk (x) . 

We define the autocorrelation function of the random 
process X(t ) to be a function of two variables, 1 1 and ti, given by: 

R x {t 1 ,t 2 ) = E{X{t 1 ')X{t 2 )} (1.56) 

where X(t x ) and X (t 2 ) are random variables obtained by 
observing X(t ) at times t x and t 2 , respectively. The 
autocorrelation function is a measure of the degree to which two 
time samples of the same random process are related. 

1.8.2 Stationarity 

A random process X(t) is said to be stationary in the strict 
sense if none of its statistics are affected by a shift in the time 
origin. A random process is said to be wide-sense stationary (WSS) 
if two of its statistics, its mean and autocorrelation function, do not 
vary with a shift in the time origin. Thus, a process is WSS if 

E{X(t )} = m x = a constant (1-57) 

R X (fi> £ 2 ) — — £ 2 ) (1.58) 


Strict-sense stationary implies wide-sense stationary, but 
not vice versa. Most of the useful results in communication theory 




16 


are predicated on random information signals and noise being 
wide-sense stationary. From a practical point of view, it is not 
necessary for a random process to be stationary for all time but 
only for some observation interval of interest. 

For stationary processes, the autocorrelation function in 
Equation (1 .58) does not depend on time but only on the difference 
between t\ and ti. That is, all pairs of values of X(t ) at points in 
time separated by r — t x — t 2 have the same correlation value. 
Thus, for stationary systems, we can denote R x (t 1 ,t 2 ) simply 
as R x { t). 

1.8.3 Autocorrelation of Wide-Sense Stationary Process 

Just as the variance provides a measure of randomness for 
random variables, the autocorrelation function provides a similar 
measure for random processes. For a wide-sense stationary 
process, the autocorrelation function is only a function of the time 
difference r = t x — t 2 ; that is, 

R x (r ) = E{X{t)X(t + t)} for — oo < t < oo (1.59) 

For a zero mean WSS process, R x (r ) indicates the extent 
to which the random values of the process separated by r seconds 
in time are statistically correlated. In other words, R x (r ) gives us 
an idea of the frequency response that is associated with a random 
process. If R x (r ) changes slowly as r increases from zero to some 
value, it indicates that, on average, sample values of X(t) taken at 
t — t ± and t — t t + T are nearly the same. Thus, we would expect 
a frequency domain representation of X(t ) to contain a 
preponderance of low frequencies. On the other hand, if R x ( r) 
decreases rapidly as T is increased. We would expect X(t) to 
change rapidly with time and thereby contain mostly high 
frequencies. Properties of the autocorrelation function of a real- 
valued wide-sense stationary process are as in Table. 1.1. 




17 


Table. 1.1: Properties of Autocorrelation of a Real-valued 
Wide-sense Stationary Process 


n 

Property 

Meaning 

1 

CO = Rx(~ 0 

Symmetrical in r about zero 

2 

R x (.r) < RxW all r 

maximum value occurs at the origin 

3 

Rx ( 0 G x (f) 

Autocorrelation and power spectral 
density form a Fourier transform pair 

4 

R x (0) = E{x 2 (t )} 

Value at the origin is equal to the 
average power of the signal 


1.8.4 Time Averaging and Ergodicity 

To compute m x and R x (r) by ensemble averaging, we 
would have to average across all the sample functions of the 
process and would need to have complete knowledge of the first- 
and second-order joint probability density functions. Such 
knowledge is generally not available. 

When the random process belongs to a special class, known 
as an ergodic process, its time averages equal its ensemble 
averages, and the statistical properties of the process can be 
determined by time averaging over a single sample function of the 
process. For a random process to be ergodic, it must be stationary 
in the strict sense. An ergodic process is stationary, but, a 
stationary process is not necessarily ergodic. However, for 
communication systems, where we are satisfied to meet the 
conditions of wide-sense stationarity, we are interested only in the 
mean and autocorrelation functions. We can say that a random 
process is ergodic in the mean and autocorrelation if: 

m x = lim ' X(t)dt (1.60) 

jH->oo l 1 1 *■ 

Rx(j) = lim 2 X(t)X(t + r)dt (1.61) 

T->co 1 1 / *■ 




18 


A reasonable assumption in the analysis of most 
communication signals (in the absence of transient effects) is that 
the random waveforms are ergodic in the mean and the 
autocorrelation function. Since time averages equal ensemble 
averages for ergodic processes, fundanlcntal electrical 
engineering parameters such as de value, rms value, and average 
power can be related to the moments of an ergodic random process. 
Following is a summary of these relationships: 

• m x = E{X(t )} is equal to the de level of the signal. 

• m'x is equal to the normalized power in the dc component. 

• S nd momcntE (A 2 (t)}, equals total average normalized power 

• x/ E{X 2 (t)} equals rms value of the voltage or current signal. 

• Variance o x is the average normalized power in the time- 
varying or ac component of the signal. 

• If the process has zero mean, its variance is the same as the 
mean square value, or it represents the total poser in the 
normalized load. 

• Standard deviation er^is the rms of ac component of signal. 

• If m x = 0, then o x is the rms value of the signal. 

1.8.5 Power Spectral Density and Autocorrelation of Process 

A random process X (t) can generally be classified as a 
power signal having a power spectral density G x (f ) of the form 
shown in Equation (1.20). G x (f ) is particularly useful in 
communication systems, because it describes the distribution of a 
signal 's power in the frequency domain. The psd enables us to 
evaluate the signal power that will pass through a network having 
known frequency characteristics. We summarize the principal 
features of psd functions as follows: 




19 


• G x (f) > 0 and is always real valued 

• G x (/) = G x {— /) for X(t) real-valued 

• G x (f ) R x ( r) 

• P x — G x (/) d/ The relationship between average 

normalized power and p.sd 

1.9 Noise 

The term /zo/.se is used customarily to designate unwanted 
signals that tend to disturb the transmission and processing of 
signals in communication systems, and over which we have 
incomplete control. In practice, we find that there are many 
potential sources of noise in a communication system. The sources 
of noise may be external to the system (e.g., atmospheric noise, 
galactic noise, man-made noise) or internal to the system. 

The second category includes an important type of noise 
that arises from the phenomenon of spontaneous fluctuations of 
current flow that is experienced in all electrical circuits. In a 
physical context, the most common examples of the spontaneous 
fluctuation phenomenon are shot noise, which, as stated in Section 
4.10, arises because of the discrete nature of current flow in 
electronic devices; and thermal noise, which is attributed to the 
random motion of electrons in a conductor. 

However, insofar as the noise analysis of communication 
systems is concerned, be they analog or digital, the analysis is 
customarily based on a source of noise called white-noise, which 
is discussed next. 

1.9.1 White Noise 

This source of noise is idealized, in that its power spectral 
density is assumed to be constant and, therefore, independent of 




20 


the operating frequency. The adjective “white” is used in the sense 
that white light contains equal amounts of all frequencies within 
the visible band of electromagnetic radiation. We may thus make 
the statement: 

White noise, denoted by W(t), is a stationary process whose 

power spectral density Sw(f) has a constant value across the 

entire frequency interval. 

Clearly, white-noise can only be meaningful as an abstract 
mathematical concept; we say so because a constant power spectral 
density corresponds to an unbounded spectral distribution function 
and, therefore, infinite average power, which is physically 
nonrealizable. 

Nevertheless, the utility of white-noise is justified in the 
study of communication theory by virtue of the fact that it is used 
to model channel noise at the front end of a receiver. 

Typically, the receiver includes a filter whose frequency 
response is essentially zero outside a frequency band of some finite 
value. Consequently, when white noise is applied to the model of 
such a receiver, there is no need to describe how the power spectral 
density Sww(f) falls off outside the usable frequency band of the 
receiver. Let: 

Sww(f)=Y (L62) 

as illustrated in Fig. 1.3. a. Since the autocorrelation 
function is the inverse Fourier transform of the power spectral 
density in accordance with the Wiener-Khintchine relations, it 
follows that for white-noise the autocorrelation function is 

t) (1.63) 

Hence, the autocorrelation function of white noise consists 
of a delta function weighted by the factor No/2 and occurring at the 
time shift r = 0, as shown in Fig.l.3.b. 




21 

Since R ww (t ) is zero for r =£ 0, it follows that any two 
different samples of white noise are uncorrelated no matter how 
closely together in time those two samples are taken. 

If the white noise is also Gaussian, then the two samples 
are statistically independent in accordance with Property 4 of the 
Gaussian process. In a sense, then, white Gaussian noise represents 
the ultimate in “randomness”. 

The utility of a white-noise process in the noise analysis of 
communication systems is parallel to that of an impulse function 
or delta function in the analysis of linear systems. 


%(/) 


flyytr) 



N 0 


1 — 5fr) 


2 


2 






(a) Power Spectral Density (b) Autocorrelation Function 

(b) Fig. 1.3: Characteristics of White Noise 

Just as we may observe the effect of an impulse only after 
it has been passed through a linear system with a finite bandwidth, 
so it is with white noise whose effect is observed only after passing 
through a similar system. We may therefore state: 

As long as the bandwidth of a noise process at the input of 
a system is appreciably larger than the bandwidth of the 
system itself, then we may model the noise process as white 
noise. 

1.9.2 Ideal Low-pass Filtered White Noise 

Suppose that a white Gaussian noise of zero mean and 
power spectral density No/2 is applied to an ideal low-pass filter of 
bandwidth B and passband magnitude response of one. The power 
spectral density of the noise N(t) appearing at the filter output, as 
shown in Fig. 1.4. a, is therefore: 


22 


$nn (/) 


f“T > —B < f < B 

lo , |/| >5 


(1.64) 


Since the autocorrelation function is the inverse Fourier 
transform of the power spectral density, it follows that: 

Rnn(j) = J* B y e j2nfT df = N 0 B sinc(2Sr) (1.65) 


whose dependence on r is plotted in Fig.l.4.b. From this 
figure, we see that R nn (t) has the maximum value N 0 B at the 
origin and it passes through zero at r = +k/(2B'), where k — 
1,2,3,... 


Since the input noise W(t) is Gaussian (by hypothesis), it 
follows that the band-limited noise N(t) at the filter output is also 
Gaussian. Suppose, then, that IV (t) is sampled at the rate of 2 B 
times per second. 

From Fig.l.4.b, we see that the resulting noise samples are 
uncorrelated and, being Gaussian, they are statistically 
independent. 

Accordingly, the joint probability density function of a set 
of noise samples obtained in this way is equal to the product of the 
individual probability density functions. Note that each such noise 
sample has a mean of zero and variance of N 0 B. 



(a) Power Spectral Density (b) Autocorrelation Function 

Fig. 1.4: Characteristics of Low Pass Filtered White Noise 


23 

1.9.3 Correlation of White Noise with Sinusoidal Wave 

Consider the sample function: 

w '(t) = cos(2jrf c t) dt (1.66) 

which is the output of a correlator with white Gaussian 
noise sample function w(t) and sinusoidal wave 
■sj'2 /T cos(2nf c t) as its two inputs; the scaling -J2/T factor is 
included in (4.104) to make the sinusoidal wave input have unit 
energy over the interval 0 < t < T. 

With w(t) having zero mean, it immediately follows that 
the correlator output w(t) has zero mean too. The variance of the 
correlator output is therefore defined by: 

2 f T f T 

a w’ = E r I w(t 1 )cos(27r/ c t 1 )w(t 2 )cos(27r/ c t 2 )dt 1 dt 2 
l 1 Jo Jo 

= ^/ 0 7 ’/ 0 T F[w(t 1 )w(t 2 )] cos( 27 r/cti) cos(27t f c t 2 ) 

= ffofolT 3 ^ 1 _ G) C °s(27r/ c t 1 ) cos(27r/ct 2 ) dt 1 dt 2 (4.67) 

where, in the last line, we made use of (4.101). We now 
invoke the sifting property of the delta function, namely: 

LI 9 (0 8 (0 dt — g (0) (4.68) 

where g(t) is a continuous function of time that has the 
value $(0) at time / = 0. Hence, we may further simplify the 
expression for the noise variance as: 

<*W' = ~ Sl T cos2 ( 2n fcO dt =^J 0 T [1 + cos(47T f c t)]d.t = Y (4.69) 

where, in the last line, it is assumed that the frequency f c of 
the sinusoidal wave input is an integer multiple of the reciprocal of 
T for mathematical convenience. 




24 

1.10 Narrowband Noise 

The receiver of a communication system usually includes 
some provision for preprocessing the received signal. Typically, 
the preprocessing takes the form of a narrowband filter whose 
bandwidth is just large enough to pass the modulated component 




(a) Power Spectral Density (b) Sample Function 

Fig. 1.5: Narrowband Noise 

of the received signal essentially undistorted, so as to limit the 
effect of channel noise passing through the receiver. The noise 
process appearing at the output of such a filter is called 
narrowband noise. With the spectral components of narrowband 
noise concentrated about some midband frequency ±fc as in 
Fig. 1.5. a, we find that a sample function n(t) of such a process 
appears somewhat similar to a sine wave of frequency /c. The 
sample function n(t) may, therefore, undulate slowly in both 
amplitude and phase, as illustrated in Fig.l.5.b. 

Consider, then, the n{t) produced at the output of a 
narrowband filter in response to the sample function w{t) of a white 
Gaussian noise process of zero mean and unit power spectral 
density applied to the filter input; w{t) and nil) are sample functions 
of the respective processes W(t) and N(t). Let H(f) denote the 
transfer function of this filter. Accordingly, we may express the 
power spectral density S N (f ) of the noise N(t) in terms of H (/) : 

S N (0 = (1.70) 


25 

On the basis of this equation, we may now make the 
following statement: 

Any narrowband noise encountered in practice may be 
modeled by applying a white-noise to a suitable filter in the 
manner described in (4.70). 

In this section we wish to represent the narrowband noise 
n(t) in terms of its in-phase and quadrature components in a manner 
similar to that described for a narrowband signal in Section 2.10. 
The derivation presented here is based on the idea of pre-envelope 
and related concepts, which were discussed in Chapter 2 on Fourier 
analysis of signals and systems. 

Let n+(t) and , respectively, denote the pre-envelope and 
complex envelope of the narrowband noise n(t). We assume that 
the power spectrum of n(t) is centered about the frequency fc. Then 


we may write where is the Hilbert transform of n(t). The complex 
envelope may itself be expressed as 

n + (t) = n(t) -I- jn(t ) (1.71) 

n(t) = n + (t) e~i 2n f ct (1-72) 

Where n(t) is the Hilbert transform of n(t). The complex 
envelope may itself be expressed as: 

n(t) = 77/ (t) + jn Q (t) (1.73) 

Hence, combining (4.71) through (4.73), we find that the 
in-phase component 71/ (t) and the quadrature component tLq (t) of 
the narrowband noise n(t) are respectively: 

71/ (t) = 7i(t) cos(27t f c t) + n(t) sin(27r/ c t) (1-74) 

tiq (t) = **(0 cos(27t f c t) — n(t) sin(27r/ c t) (1-75) 


Eliminating between (4.74) and (4.75), we get the desired 
canonical form for representing the narrowband noise n(t), as 
shown by: 




26 


n(t) = n 7 (t) cos(2nf c t ) — n Q (t ) sin(27r/ c t) (1-76) 

Using (4.112) to (4.76), we may now derive some 
important properties of the in-phase and quadrature components of 
a narrowband noise, as described next. 

First: The in-phase component n 7 (t) and quadrature component 
n Q (t ) of narrowband noise n(t) have zero mean. 

To prove this property, we first observe that the noise is 
obtained by passing n(t) through a linear filter (i.e., Hilbert 
transformer). Accordingly, hit) will have zero mean because n(t) 
has zero mean by virtue of its narrowband nature. Furthermore, 
from (4. 112) and (4.113), we see that n 7 (t) and n Q (t) are weighted 
sums of n(t) and n(t). It follows, therefore, that the in-phase and 
quadrature components, n 7 (t) and tin (t). both have zero mean. 

Second: If the narrowband noise n(t) is Gaussian, then its in-phase 
component and quadrature component are jointly Gaussian. 

To prove this property, we observe that n(t ) is derived from 
n(t) by a linear filtering operation. Hence, if n(t) is Gaussian, the 
Hilbert transform is also Gaussian, and n(t) and fi(l) are jointly 
Gaussian. It follows, therefore, that the in-phase and quadrature 
components, n 7 (t) and n Q (t), are jointly Gaussian, since they are 
weighted sums of jointly Gaussian processes. 

Third If the narrowband noise is weakly stationary, then its in- 
phase component and quadrature component are jointly weakly 
stationary. 

If n(t) is weakly stationary, so is its Hilbert transform n(t). 
However, since the in-phase and quadrature components, 
n ; (t) and 7i(j(t), are both weighted sums of n(t) and n(t ) and the 
weighting functions cos(2n f c t) and sin(27i/ c t) , vary with time, 
we cannot directly assert that are weakly stationary. To prove 
Property 3, we have to evaluate their correlation functions. 




27 


Using (4.112) and (4.113), we find that the in-phase and 
quadrature components, n ; (t) and n Q (t), of a narrowband noise 
n(t ) have the same autocorrelation function, as shown by: 


r n,n,(j) = r n q n q <J ) = r nn W cos(2tt/ c t) +/? wjv ( t ) sin(2rr/ c T) (1.77) 

r n,n q (t) = ~ r n q n, CO = fljvivCO sin(2rr/ c T) -R nn (t) cos(2tt/ c t) (1.78) 

Where R nn (t) is the autocorrelation function of n(t), and 
R nn (t) is its Hilbert transform. From (4.77) and (4.78), we readily 
see that the correlation functions R NjNj (t), R NqNq (t), and 
Rn,n q ( t ) °f the in-phase and quadrature components and depend 
only on the time shift r. This dependence, in conjunction with 
Property 1, proves that n ; (t) and Uq (t) are weakly stationary if the 
original narrowband noise is weakly stationary. 


Forth: Both the in-phase noisenj(t) and quadrature noise Uq (t) 
have the same power spectral density, which is related to the power 
spectral density Snn(J ) of the original narrowband noise as 
follows: 


R N,N,if) — R N Q N Q (f) — | 


NN (/ ~ fc)+ S NN(f + fc )’ 

O, 


—B < f < B 
elsewhere 


(1.79) 


Where it is assumed that Snn(J) occupies the frequency 
interval f c — B < \f\ < f c + B andfc > B. 

Fifth: The in-phase and quadrature components n, (t) and Uq (t) 
have the same variance as the narrowband noise n(t). 

Six: The cross-spectral densities of the in-phase and quadrature 
components of a narrowband noise are purely imaginary, as 
shown by: 


= \i\S N {f + fc) - S N (f - fc)], —B < f < B 
l 0, otherwise 


(1.80) 




28 


Seventh: If a narrowband noise n(t) is Gaussian with zero mean 
and a power spectral density Snn( f) that is locally symmetric about 
the midband frequency +f c , then the in-phase noise n 7 (t) and the 
quadrature noise n Q (t) are statistically independent. 




